Network Summarization with Preserved Spectral Properties
نویسندگان
چکیده
Large-scale networks are widely used to represent object relationships in many real world applications. The occurrence of large-scale networks presents significant computational challenges to process, analyze, and extract information from such networks. Network summarization techniques are commonly used to reduce the computational load while attempting to maintain the basic structural properties of the original network. Previous works have primarily focused on some type of network partitioning strategies with application-dependent regularizations, most often resulting in strongly connected clusters. In this paper, we introduce a novel perspective regarding the network summarization problem based on concepts from spectral graph theory. We propose a new distance measurement to characterize the spectral differences between the original and coarsened networks. We rigorously justify the spectral distance with the interlacing theorem as well the results from the stochastic block model. We provide an efficient algorithm to generate the coarsened networks that maximally preserves the spectral properties of the original network. Our proposed network summarization framework allows the flexibility to generate a set of coarsened networks with significantly different structures preserved from different aspects of the original network, which distinguishes our work from others. We conduct extensive experimental tests on a variety of large-scale networks, both from real-world applications and the random graph model. We show that our proposed algorithms consistently perform better results in terms of the spectral measurements and running time compared to previous network summarization algorithms.
منابع مشابه
An efficient method for cloud detection based on the feature-level fusion of Landsat-8 OLI spectral bands in deep convolutional neural network
Cloud segmentation is a critical pre-processing step for any multi-spectral satellite image application. In particular, disaster-related applications e.g., flood monitoring or rapid damage mapping, which are highly time and data-critical, require methods that produce accurate cloud masks in a short time while being able to adapt to large variations in the target domain (induced by atmospheric c...
متن کاملSpectral Estimation of Printed Colors Using a Scanner, Conventional Color Filters and applying backpropagation Neural Network
Reconstruction the spectral data of color samples using conventional color devices such as a digital camera or scanner is always of interest. Nowadays, multispectral imaging has introduced a feasible method to estimate the spectral reflectance of the images utilizing more than three-channel imaging. The goal of this study is to spectrally characterize a color scanner using a set of conventional...
متن کاملQuantifying loss of information in network-based dimensionality reduction techniques
To cope with the complexity of large networks, a number of dimensionality reduction techniques for graphs have been developed. However, the extent to which information is lost or preserved when these techniques are employed has not yet been clear. Here we develop a framework, based on algorithmic information theory, to quantify the extent to which information is preserved when network motif ana...
متن کاملA survey on Automatic Text Summarization
Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...
متن کاملSimultaneous Clustering and Noise Detection for Theme-based Summarization
Multi-document summarization aims to produce a concise summary that contains salient information from a set of source documents. Since documents often cover a number of topical themes with each theme represented by a cluster of highly related sentences, sentence clustering plays a pivotal role in theme-based summarization. Moreover, noting that realworld datasets always contain noises which ine...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1802.04447 شماره
صفحات -
تاریخ انتشار 2018